This work evaluates a few search strategies for Arabic monolingual and cross-lingual retrieval, using the TREC Arabic corpus as the test-bed. The release by NIST in 2001 of an Arabic corpus of nearly 400k documents with both monolingual and cross-lingual queries and relevance judgments has been a new enabler for empirical studies. Experimental results show that spelling normalization and stemming can significantly improve Arabic monolingual retrieval. Character tri-grams from stems improved retrieval modestly on the test corpus, but the improvement is not statistically significant. To further improve retrieval, we propose a novel thesaurus-based technique. Different from existing approaches to thesaurus-based retrieval, ours formulates word...
Abstract. Although it has been shown that in test collection based studies, stemming improves retrie...
In cross-language information retrieval (CLIR), queries in a source language are used to retrieve re...
Most currently available test collections and almost all CLIR collections have focused upon general-...
Using information retrieval systems to gain access to documents in languages other than English is b...
Using information retrieval systems to gain access to documents in languages other than English is b...
Arabic, a highly inflected language, requires good stemming for effective information retrieval, yet...
The focus of the experiments reported in this paper was techniques for combining evidence for cross...
The focus of the experiments reported in this paper was techniques for combining evidence for cross-...
In TREC 2002 the Berkeley group participated only in the English-Arabic cross-language retrieval (CL...
Arabic is a major international language, spoken in more than 23 countries, and the lingua franca of...
International audienceThis paper addresses the optimization of information retrieval in Arabic. Sign...
Sheffield’s participation in the inaugural Arabic cross language track is described here. Our goal w...
ABSTRACT- This paper describes correction and expansion techniques of multilingual search queries su...
Ten groups participated in the TREC-2001 cross-language information retrieval track, which focussed ...
Although it has been shown that in test collection based studies, stemming improves retrieval effect...
Abstract. Although it has been shown that in test collection based studies, stemming improves retrie...
In cross-language information retrieval (CLIR), queries in a source language are used to retrieve re...
Most currently available test collections and almost all CLIR collections have focused upon general-...
Using information retrieval systems to gain access to documents in languages other than English is b...
Using information retrieval systems to gain access to documents in languages other than English is b...
Arabic, a highly inflected language, requires good stemming for effective information retrieval, yet...
The focus of the experiments reported in this paper was techniques for combining evidence for cross...
The focus of the experiments reported in this paper was techniques for combining evidence for cross-...
In TREC 2002 the Berkeley group participated only in the English-Arabic cross-language retrieval (CL...
Arabic is a major international language, spoken in more than 23 countries, and the lingua franca of...
International audienceThis paper addresses the optimization of information retrieval in Arabic. Sign...
Sheffield’s participation in the inaugural Arabic cross language track is described here. Our goal w...
ABSTRACT- This paper describes correction and expansion techniques of multilingual search queries su...
Ten groups participated in the TREC-2001 cross-language information retrieval track, which focussed ...
Although it has been shown that in test collection based studies, stemming improves retrieval effect...
Abstract. Although it has been shown that in test collection based studies, stemming improves retrie...
In cross-language information retrieval (CLIR), queries in a source language are used to retrieve re...
Most currently available test collections and almost all CLIR collections have focused upon general-...